NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Harnessing the Computing Continuum for Urgent Science

https://doi.org/10.1145/3439602.3439618

Balouek-Thomert, Daniel; Rodero, Ivan; Parashar, Manish (November 2020, ACM SIGMETRICS Performance Evaluation Review)

Full Text Available
An edge-aware autonomic runtime for data streaming and in-transit processing

https://doi.org/10.1016/j.future.2020.03.037

Zamani, Ali Reza; Balouek-Thomert, Daniel; Villalobos, J.J.; Rodero, Ivan; Parashar, Manish (September 2020, Future Generation Computer Systems)
null (Ed.)
Full Text Available
A Distributed Multi-Sensor Machine Learning Approach to Earthquake Early Warning

https://doi.org/10.1609/aaai.v34i01.5376

Fauvel, Kevin; Balouek-Thomert, Daniel; Melgar, Diego; Silva, Pedro; Simonet, Anthony; Antoniu, Gabriel; Costan, Alexandru; Masson, Véronique; Parashar, Manish; Rodero, Ivan; et al (June 2020, Proceedings of the AAAI Conference on Artificial Intelligence)

Our research aims to improve the accuracy of Earthquake Early Warning (EEW) systems by means of machine learning. EEW systems are designed to detect and characterize medium and large earthquakes before their damaging effects reach a certain location. Traditional EEW methods based on seismometers fail to accurately identify large earthquakes due to their sensitivity to the ground motion velocity. The recently introduced high-precision GPS stations, on the other hand, are ineffective to identify medium earthquakes due to its propensity to produce noisy data. In addition, GPS stations and seismometers may be deployed in large numbers across different locations and may produce a significant volume of data consequently, affecting the response time and the robustness of EEW systems.In practice, EEW can be seen as a typical classification problem in the machine learning field: multi-sensor data are given in input, and earthquake severity is the classification result. In this paper, we introduce the Distributed Multi-Sensor Earthquake Early Warning (DMSEEW) system, a novel machine learning-based approach that combines data from both types of sensors (GPS stations and seismometers) to detect medium and large earthquakes. DMSEEW is based on a new stacking ensemble method which has been evaluated on a real-world dataset validated with geoscientists. The system builds on a geographically distributed infrastructure, ensuring an efficient computation in terms of response time and robustness to partial infrastructure failures. Our experiments show that DMSEEW is more accurate than the traditional seismometer-only approach and the combined-sensors (GPS and seismometers) approach that adopts the rule of relative strength.
more » « less
Full Text Available
Runtime Management of Data Quality for Scientific Observatories Using Edge and In-Transit Resources

https://doi.org/10.1109/SBAC-PAD.2018.00053

Zamani, Ali Reza; Balouek-Thomert, Daniel; Villalobos, J. J.; Rodero, Ivan; Parashar, Manish (January 2018, 2018 30th International Symposium on Computer Architecture and High Performance Computing)

Full Text Available
Supporting Data-Driven Workflows Enabled by Large Scale Observatories

https://doi.org/10.1109/eScience.2017.95

Zamani, Ali Reza; AbdelBaky, Moustafa; Balouek-Thomert, Daniel; Rodero, Ivan; Parashar, Manish (October 2017, 2017 IEEE 13th International Conference on e-Science (e-Science))

Large scale observatories are shared-use resources that provide open access to data from geographically distributed sensors and instruments. This data has the potential to accelerate scientific discovery. However, seamlessly integrating the data into scientific workflows remains a challenge. In this paper, we summarize our ongoing work in supporting data-driven and data-intensive workflows and outline our vision for how these observatories can improve large-scale science. Specifically, we present programming abstractions and runtime management services to enable the automatic integration of data in scientific workflows. Further, we show how approximation techniques can be used to address network and processing variations by studying constraint limitations and their associated latencies. We use the Ocean Observatories Initiative (OOI) as a driving use case for this work.
more » « less
Full Text Available
Submarine: A Subscription-based Data Streaming Framework for Integrating Large Facilities and Advanced Cyberinfrastructure

Zamani, Ali Reza; AbdelBaky, Moustafa; Balouek-Thomert, Daniel; Villalobos, J. J.; Rodero, Ivan; Parashar, Manish (January 2017, The Eighth International Workshop on Data-Intensive Computing in the Clouds)

Full Text Available
Submarine: A subscription‐based data streaming framework for integrating large facilities and advanced cyberinfrastructure

https://doi.org/10.1002/cpe.5256

Zamani, Ali Reza; AbdelBaky, Moustafa; Balouek‐Thomert, Daniel; Villalobos, J. J.; Rodero, Ivan; Parashar, Manish (April 2019, Concurrency and Computation: Practice and Experience)

Summary Large scientific facilities provide researchers with instrumentation, data, and data products that can accelerate scientific discovery. However, increasing data volumes coupled with limited local computational power prevents researchers from taking full advantage of what these facilities can offer. Many researchers looked into using commercial and academic cyberinfrastructure (CI) to process these data. Nevertheless, there remains a disconnect between large facilities and CI that requires researchers to be actively part of the data processing cycle. The increasing complexity of CI and data scale necessitates new data delivery models, those that can autonomously integrate large‐scale scientific facilities and CI to deliver real‐time data and insights. In this paper, we present our initial efforts using the Ocean Observatories Initiative project as a use case. In particular, we present a subscription‐based data streaming service for data delivery that leverages the Apache Kafka data streaming platform. We also show how our solution can automatically integrate large‐scale facilities with CI services for automated data processing.
more » « less

Search for: All records